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PLh ■ Let P be a probability distribution on M. (equipped with an Euclidean norm). Let r, s > 

^ and assume {a n ) n >i is an (asymptotically) L r (P)-optimal sequence of n-quantizers. In this 

paper we investigate the asymptotic behavior of the maximal radius sequence induced by the 
sequence (a n ) n >i and defined to be for every n > 1, p(a n ) = max{|a|, a <E a„}. We show 
that if card(supp(P)) is infinite, the maximal radius sequence goes to sup{|x|,x £ supp(P)} 
as ?i goes to infinity. We then give the rate of convergence for two classes of distributions with 
unbounded support : distributions with exponential tails and distributions with polynomial tails. 
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1 Introduction 
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oo 
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Quantization has become an important field of information theory since the early 1940's. Nowadays, 
q ■ it plays an important rule in Digital Signal Processing (DSP), the basis of many areas of technology, 

from mobile phones to modems and multimedia PCs. In DSP, quantization is the process of approxi- 
mating a continuous range of values or a very large set of discrete values by a relatively small set of 
discrete values. A common use of quantization is the conversion of a continuous signal into a digital 
signal. This is performed in analog-to-digital converters with a given quantization level. Beside these 
fields, quantization has recently become a domain of interest in Numerical Probability specially in nu- 
merical pricing of financial derivatives when their prices read as an expectation (or involve conditional 
expectations) of some random processes (see e.g. [T|). 

From a mathematical point of view, the L r -optimal quantization problem at level n for a Re- 
valued random vector X lying in L r (Q,A, P) consists in finding the best approximation of X by 
q(X), where q is a Borel function taking at most n values. This reads as the following minimization 
problem: 

e n , r (X) =mi{\\X-q(X)\\ r ,q :R d ^R d , card(g(M d )) < n}. 
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Note that in fact e n%r {X) only depends on the distribution P = Fx of X so that we will also use the 
notation e n>r (P) . However, for any Borel function q : M rf — > a we have 

\X-q(X)\ >mind(X,a) =d(X,a) = \X - X a \ F a.s 

so that the quantization problem reduces to 

e n>r (X) = mf{\\X -X a \\ r ,a C R d , card(a) < n} 

inf f/ d(x,a) r dP(x)^| 7 . (1.1) 

card(a)<n 

where X a = ^ae» a l{.YeC a (a)} an d (Ca(«))aGa corresponds to a Voronoi partition of R. d (with 
respect to a norm | • | on R d ), that is, a Borel partition of R. d satisfying for every a G a, 

C a (a) C {x G M d : \x - a\ = min \x - b\}. 

For every n > 1, the infimum in (jl.ip holds as a finite minimum reached (at least) at one grid 
a*. In this case a* is called an L r (P) -optimal (or L r -optimal for X) and a sequence of n-quantizers 
(a n )„>i is L r (P)-optimal if for every n > 1, a n is L r (P)-optimal. A sequence (a n )n>i is said 
asymptotically L r (P)-optimal if 

d(x, a n ) T P{dx) = e r n r (X) + o(e T n r {X)) as n — > cxd. 

Moreover the L r -quantization error e n)r (X) decreases to as n goes to infinity and if there is 
an (r + r/)-moment of X, for r/ > 0, the so-called Zador's theorem recalled below rules its rate of 
convergence to 0. 

Zador's Theorem (see [4]). Let P = P a + P s be the Lebesgue decomposition of P with respect to 
the Lebesgue measure A^, where P a denotes the absolutely continuous part and P s the singular part 
of P. Suppose E|X| r+r? < +oo for some -q > 0. Then 

lim n r / d (e n>r (P)) r = Q r {P). 

n— >+oo 

with 

d+r 

Qr(P) = Jr,d( [ f^d\ d ) " =J T , d \\f\\_i_ G[0,+DO), 
\JM.<1 J d+r 

J r4 = inf n r / d e r nr (U([0, l] d )) G (0,+oc), 

n>l 

where U([0, l] d ) denotes the uniform distribution on the set [0, l] d and / = jjj^. Note that the 
moment assumption : E|X| r+T? < +oo ensure that II /II a is finite. 

d+r 

Very little is known about the geometric properties of optimal quantizers. In this paper we address 
a first problem in this direction: we study the asymptotic behavior of the radii of a sequence (a n ) n >i 
of Z/-optimal quantizers. The maximal radius (or simply radius) p(a) of a quantizer a C M. d is 
defined by 

p(a) = max{|a|, a G a}. 
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In our framework, | • | will be an Euclidean norm on M. d . For the sake of simplicity, we will denote from 
now on by (p n )n>i the sequence (p(a n ))n>i of radii of a sequence (a n )n>i of optimal quantizers 
(although it may be not unique). 

We will show that, as soon as supp(P) is unbounded, lim p n = +00. Besides, our key inequal- 

n— >+oo 

ities to get the upper and lower estimates of the maximal radius sequence are provided in Theorem 
13.11 and Theorem 13.21 The first theorem yields amount others the maximal rate of convergence of 
F r ( Pn ) (when n — > +00) to 0, for every e > 0, with c r d = 1 if d = 1; r > 1 and c r d = 2 

otherwise. It claims that this rate is at most equals to n~( l+T l d \ 

Theorem [3T2] maintains in particular that for every u > 1, F r {p n u) goes to 0, as n goes to infinity, 

r-\-v 

at a rate at less equals to n d , where v is such that the random vector X has an (r, r + ^-distribution 
(see Definition l3.ll ). We will see later on that the index v\ ensuring that X has an (r, r+i/)-distribution 
for every v G (0,1/*) will play a crucial role in the lower limit estimates of the maximal radius 
sequence. 

Then we will emphasize how knowing the asymptotic behavior of the function — log F r allow to 
derive the asymptotic estimates of p n (or log p n ). As an important example we can already mention 
distributions with density function / satisfying 

f(x) cc ^fj^ 1{|»|>1} x G R d , P G R, c> r + d 
for which the optimal rate of convergence of log p n is computed and given by 

li m lQ g P' n - 1 r + d 
n— >+oo log(n) c — r — d d 

Of course, this result is less accurate as giving the rate of convergence of the sequence (p n ) itself 
for which the exact limit can not be computed with our approach because the upper and lower limits 
make appear no identified constants. Another example concerns distributions with exponential tail for 
which the upper and lower rates of convergence of the sequence (p n ) are provided. This is the case 
for the normally distributed random vector on R. d for which we have 



2 ±±^ < hminf < limsup^^ < 2J^P-. 

d v /l g(?i) ^/log(?i) V d 

Our general conjecture for such distributions, which is proved when d = 1 and r > 1, is that the 
liminf bound is sharp, that is, 

Um Pn _ . [W+d) 



^/\og{n) V d 

Moreover, an alternative approach is given for the lower limit estimates. This approach is based 
on random quantization and relies p n to the expectation of an i.i.d sequence of random variables 
distributed as X. 

The paper is organized as follows. In Section 2, upper and lower estimates of the maximal radius 
sequence are given and the exact limit is provided when the cardinal of the support of P is infinite. 
This limit corresponds to sup{| x\ , x G supp(P)} and Section 3 is entirely devoted to the convergence 
rate of the sequence of radii to this limit value. 
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Notations : Throughout the paper X will denote an Revalued random vector defined in the probability space 
(fl, A, P) with distribution P having a moment of order r > i.e. E\X\ r < +00. We define 

77+ (P) = \jL r+e (P). 

£>0 

We will denote by the Lebesgue measure on (M d ,S(K d )). We will also denote by F the survival 
function of X, that is, the (0, l]-valued function defined on K + by 

F : x i-> F(x) = F{{\X\ > x}) 

and for every r > 0, we define the generalized survival function of X by 

F r :x»F r (x)=E(\X\ r l {lx>x} ). 

Note that this last function is defined on M. + and takes values on the set (0, E|X| r ] . 

For a given set A, A will stand for its closure, d A its boundary, Conv(A) its convex hull and A or Int(A) 
its interior. The cardinal of A is denoted by card(A). For every x > 0, [x] will denote the integral part of x. 

2 Asymptotics of the the maximal radius sequence 

In this section we give an asymptotic upper bound and a lower bound of the sequence of radii. For 
distributions supported by a infinite set, the exact limit is provided. 

Proposition 2.1. Let X G L^ d (P). Let (a n ) n >i be a sequence of n-quantizers such that e njT .(X) — > 
as n — > +00. Then, 

liminfp n > sup{|x|, x E supp(P)}. (2.1) 



n— >+oo 



Remark that this result also holds for any norm on R. d . 
Proof. Let x G supp(P). Suppose that there exists eq > and a subsequence (p nfe )fc>i sucn that 

Vfc>l, Pnk <\x\-2e . (2.2) 

Thus 

3 r] > such that Vfe, d(B(0, p nk ), B(x, e )) > r] > 0. 
Then one has for every k > 1, 

e nk ,r( x ) = \\d(X,a nk )\\ r 

> \\d(X,B(0,p nk ))\\ r (since a nk C B(0,p nk )) 

> \\d(X,B(0,p nk ))l {x& B(x,eo)}\\r 

> \\d(B(x, £ ),B(0, Pn k ))l{XeB(x,s )} \\r 

= d(B(x,e ),B(0,p n J)F(X G ^(x^o)) 1 ^ 

> V F{X € Bix^o)) 1 ^ >0. 

This is not possible since e n r i^X^ — ► 0. Then, we have shown that 

Vx G supp(P), liminf/9 n > |x|. 

n 

Hence liminf/3 n > sup{|x|, x G supp(P)}. □ 
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Among other results, the next proposition provides the limit of the sequence (p n )n>i when the 
support of P is infinite. 

Proposition 2.2. (a) Let a be an L r -optimal quantizer at level n. 7/'card(supp(P)) > n then 



a C Conv(supp(P)) and p n < sup{|x|, x G supp(P)}. (2.3) 



(b) 7/'card(supp(P)) = +00 then 



lim p n = sup/9 n , = sup{|x|, x G supp(P)}. (2.4) 

n-»+oo n>1 



for any U (P) -optimal sequence of quantizers (a n )n>i- 

Proof, (a) If a is L r -optimal at level n then card(a) = n since card(supp(-P)) > n (see [9]). 



Now, suppose that a Conv(supp(i-')). Then let a G a D ( Conv(supp(P)) ) and set 

a' = {a\{a}) U {11(a)} 



where II denotes the projection on the non empty closed convex set Conv(supp(P)). The projection 
is 1-Lipschitz and X is P-a.s supp(i- > )-valued, hence 

d(X,a) > d(U{X),U(a)) F = s d(X,U(a)). 

It follows that 

d{X,a) > d(X,a') F-a.s. 
Since a is L r (P)-optimal at level n and card(a') < card(a) = n, 

E(d(X,a') r ) =K( y d(X,a) r ) 

so that the three statements hold: 

- d(X,a') = d(X,a) P-a.s 

- n(a) a\{a} since a' is L r (P)-optimal (which implies that card(o') = n), 

- F(X £ C n ( a )(a')) > (otherwise a'\{il(a)} would be optimal). 
On the other hand, X G Conv(supp(P)) P-a.s so that 

(a - U(a)\X - n(a)) < P-a.s. 

Consequently 

|X-a| 2 -|X-n(a)| 2 = 2(n(a)-a|X-n(a)) + |a-n(a)| 2 



> \a-U(a)\ 2 >0 since a Conv(supp(P)). 

As a consequence 



d(X,a) < d(X,a) P-a.s on {X G C U ( a )(a')} 

i(a) r' 



where Cu(a)( a> ) = {C £ n(a)) < <i(£, a\{a})} since the norm is Euclidean. 



This implies that P(X £ Cn(a)(a')) = and tnen G ^Cn(a)(«0) > °5 tnis is impossible 
since a' is //-optimal (see |4j). Hence a C Conv(supp(P)). 

Now, let us prove that p n < sup{|x|, x G supp(P)}. Note first that this assertion is obvious if 
supp(P) is unbounded. 

On the other hand if supp(P) is bounded then it is compact and so is Conv(supp(P)). Let 
xo G Conv(supp(P)) be such that \xq\ = swp{\x\,x G Conv(supp(P))}. Thus 

xo = A £i + (1 - A )6, 6,^esupp(P) 

and A i — ► | A£i + ( 1 — A)^2 1 is convex so that it reaches its maximum at A = or A = 1. Consequently 
x G supp(P). 

(b) This follows from the assertion about p(a n ) in the item (a) and from Proposition 12.11 □ 

Remark 2.1. If the norm on M. d is an arbitrary norm, the assertion (a) of the proposition may fail. 
An example is given with the loo-norm in p. 25. 



3 Convergence rate of the maximal radius sequence 

We first start by giving two examples of distributions for which the sharp convergence rate of the 
maximal radius sequence can be computed rather easily. In fact the semi-closed forms established in 
[7] for the U -optimal quantizers of the exponential and the Pareto distributions and summed up in 
the following proposition allow to derive some sharp asymptotics for the maximal radius sequence 
{Pn)n>i induced by the unique sequence of L r -optimal quantizers at level n. These rates will be very 
useful to validate the asymptotic rates obtained by others approaches. 

Proposition 3.1. (see /[7|/ ) (a) Let r > and let X be an exponentially distributed random variable 
with scale parameter A > 0. Then, for every ji ^ 1, the L r -optimal Quantizer oi n — (c^n,ii * " " > *^n,n) 
is unique and given by 

an,k = \\Y + ^ w ' 1 ~ k ~ n ' ^ 3 '^ 

where (afc)fc>i is an M + -valued sequence recursively defined by the following implicit equation: 

a := +oo, 4> r (-a k+ i) := r (a fc ), k > 

with (fr r ( x ) '■= Jo^ 2 |^| r_1 sign(n)e~ M (iu (convention : 0° = 1). 

Furthermore, the sequence (ak)k>i decreases to zero and for every k > 1, 

r + 1 / c r , 1 , 

"' = — ( 1 + ¥ + 0( F 

for some positive real constant c r . 

(b) Let r > and let X be a random variable having a Pareto distribution with index 7 > r. Let f 
be the density function : f(x) = jx~^ +1 ^l^ x> i^. Then for every n > 1, the L r -optimal quantizer 
a n = (a n .i, • • • , a nj n) is unique and given by 

^ n— 1 

"n,fc = 7- (1 + Oi), 1 < k < n, (3.2) 

1 + a ™ • 7,1 

i=n— fe+1 
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where (a k )k>i is an ^+-valued sequence recursively defined by the following implicit equation: 

a = +00, 7 ( - Qfc+1 ) := <f> y (a k ), k>l, 

with 4><y(x) := f^ 2 7|n| r_1 sign(n)(l + u)~^- 1+l ^du. The sequence {au)k>i decreases to zero and 
there is some positive real constant c such that for every k > 1, 

r + 1 / c 1 



(7 — r)fc \ k k 2 ' 
Let us give now the sharp asymptotic derived from these semi-closed forms. 

Proposition 3.2. (a) Let r > and let X be an exponentially distributed random variable with 
parameter A > 0. Then 

Pn = ^log(n) + ^ + <9(i), (3.3) 
where C r is a real constant depending only on r. 

(b) Let r > and let X be a random variable with Pareto distribution of index 7 such that 7 > r. 
Then, 

log(p n ) = I±il O g(n) + C r + 0(i), (3.4) 

7 — r 7r 

where C r is a real constant depending only on r. 
Proof, (a) It follows from fl53J that 

n-l 



i=l 



where the sequence (a n ) n >i decreases to zero and satisfies for every n > 1, a n = ((r + l)/n) (l + 
c r /n + 0(l/n 2 )), for some real constant c r . Thus, 



n— 1 -, n— 1 , re— 1 

3^ 



A/>n = y + (^l)£7 + C ,£^ + £0(lA 

= (r + l)log(n) + C r + 0(i). 

n 



(b) It follows from J32J) that 

n-l 



i=l 

where (a n ) n >i is an R + -valued sequence, decreasing to zero and satisfying : Vn > 1, 
1 + c r /n + 0(l/n 2 )), for some real constant c r . 



(7— r)n 

Then, 



n— 1 9 

a.- 



log(p n ) = - log(l + o„) + £ ( 0i - + 0(a?)) 



i=i 



= ^ilog(n)+a + 0(-) 

7 — r n 

where we used that X)Si af < 00 and °( a !) < 00 ■ 1=1 
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3.1 Upper estimate 

We investigate in this section the rate of convergence of (p n ) to infinity. Let us give first some defini- 
tions and some hypotheses which will be useful later on. 

Let (a n ) n >i be an L r (P)-optimal sequence of quantizers at level n. For n > 1, we define M(a n ) 
to be 

M(a n ) = {a G a n such that |o| = max|6|}. 

b£a n 

We will need the following assumption on P 

(H) = P(dx) > e l{ xG B(x ,r )}^d(dx), e , r > 0, x G M d . 

In the one dimensional setting, we will need the following specific assumption depending on r G 

[l,+oo) : 

(G r ) P = f • Xd where / is non-increasing to on [A, +oo) for some real constant A and 

lim f + °°(u - l) r ^^P^-du = 0. (3.5) 
y^+ooJi f{y) 

Let us make some brief comments on theses assumptions as well as some simple criterions. 

• Note that Assumption (H) holds as soon as X has a density / which is bounded away from 
on a closed ball B{xq, Tq), tq > 0, xq G M. d , i.e. eo := mm x£ §^ xo ro -j f(x) > 0. This is a very light 
assumption satisfied by all usual distributions (Gaussian distribution, the exponential distribution, the 
Pareto distribution, etc). 

• Assumption (G r ) holds for distributions with density functions of the form 

f{x) oc |x| c e"^ 1 " x G R; 0, k > 0; c> -1. 
Indeed, we have for large enough y, f is non-increasing and 

Ji f(y) Ji 

+oo 



< / (u- l) r ~ Ve-^^W < +oo. 



The existence of the last integral follows from the existence of the moment of every order. 

It follows from the Lebesgue convergence theorem that (|3.5j) holds. Then Assumption (G r ) holds 
in particular for the Gaussian distribution, for the Weibull distribution and for the Gamma distribution. 
However it fails for example for the Pareto distribution. But, we will see later that we do not need this 
assumption for distributions with polynomial tails to estimate the sequence (log p n )n>i- 

Let us recall the //-stationary property which will be also useful. Assume P = f • A^. The 
so-called L r -distorsion function D^ r : — > M + is defined by : 



a = • • • , a n ) i — ► E min \X — oti 

\ i=l, ••• ,n 

Then, for every r > 1, D^ r is differentiable at any codebook having pairwise distinct components 
and (see |6] for details) 

VDf»=r / (ai-u)\u-ai\ r - 2 f{u)du\ . (3.6) 



'Ci(a) 



Ki<n 
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An optimal //-quantizer at level n a = {ai, ■ ■ ■ , a n } for P has full size n, so that, 

VD* r (a) = 0. 

a is said to satisfy an //-stationary property. 

When d = 1 then for any (ordered) quantizer a n = {x^\ • • • , x^}, xf^ < • • • < Xn) at level 
n, its Voronoi partition is given by 

Ci(a n ) = (-00, a^], C n (a n ) = (x^\,+oo), Ci(a n ) = (^_i,a:H], i = 2, • • • , n - 1, 

2 2 2 l "r 2 

■ t , (n) x\ '+x\_\ ( n ) asj ; +a;- + i 

with x) \ = — — rr^— - and x. \ = — — o-^- 

1-5 ^ J+2 ^ 

The main result of this section is the following. 

Theorem 3.1. Suppose that X has an unbounded support and that (H) holds. Let {a n ) n >\ be an 
V (P)- optimal sequence of quantizers. Then, 



(a) 



lim liminf ( n 1+ 5p r ) > C^. (3.7) 

ej.o n->+oo y v 2 + e' 



:))>C 

(b)Ifd=l,r>l and if furthermore (G r ) /lo/fifs ?/2en, 

lim liminf ( n r+1 P r (-^M ] > C r 1 [/. (3.8) 

e jo y v i + e y y 

C r ,d,u is a positive real constant depending on r, d and the uniform distribution U on [0, l] d . 

The Lemmas below are used to prove this result. 

Lemma 3.1. Let X be an M. d valued random variable with unbounded support and probability distri- 
bution P and let {a n ) n >\ be an L r (P) -optimal sequence of n-quantizers, r > 0. Let (p n ) n >i be the 
maximal radius sequence induced by (a n )n>i- Then, 



(a) V e > 0, 3 n £ such that Vn > n, 



VaE M(a n ), V£G C a (a n ), |£| > (3.9) 

(b)Ifd=l,r>l and if furthermore (G r ) holds then, for large enough n, 

V a G M(a n ), V £ G C a (a n ), \^\ > J^-. (3.10) 

Proof, (a) Since (a n ) is L r (P)-optimal, e njT .(X) — > as n — >• +00. Then, the following asymptotic 
density property of (a n ) in the support of P holds: 

Ve > 0, Vx G supp(P) P(x, e) n a n + 0. (3. 1 1) 

Otherwise, if there exists x G supp(P), e > and a subsequence (a nfc )fc>i so that V/c > 1, B(x, e) n 
a nfc = 0, then, for every k > 1, 

e„ fc ,r(*) > ||d(X,a n Jl* eB(a , e/2) || r > ^(P^,^)) 1 ^ > 0. 
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Which contradicts the fact that e n>r (X) — > as n — > +00. 

Now, to prove the result assume first G supp(P). Let e > 0, a G M(a n ). Then, 3N\ G N 
such that 5(0, e)na n / 0, Vn > N\. Now p n -> +cx) implies that 5(0, e) n (a n \M(a n )) / for 
n > N[. 

Let 6 G B(x, e) n (a n \M(a n )). We have for every £ G C a (a n ), 

\C-b\ 2 >\ti-a\\ 

namely 

2(C|a-6) > |a| 2 -|5| 2 (> 0) 
> Pl-\b?. 

Now, |£||a - b\ > (£|a - b), then, 

\t\\a-b\>^ + Wb»-W. 

Moreover, \a — b\ < \a\ + |6| < p n + \b\. One finally gets 

it, > P» ~ N > Pn-e 
151 _ 2 - 2 • 

Since p n — > +00 as n — > +00, |£| > for large enough n. 

If supp(P) we show likewise that |£| > p "~^°^~ £ , Vxo G supp(P) which implies the 
announced result since p n — > +00. 

(b) We will make an abuse of notation by considering that 

Pn = Pn := rnax{x, x G a}. 

In what follows all results on /)_ (a) := max{— x, rr G a} can be derived by using —X instead of X. 
Let a n = {x^ , • • • , Xn j and suppose that (up to a subsequence) 



.r 



(n) 
n-1 



(n) 
n 



P <\. 



Let e > such that p + e < 1. We have for large enough n, 



x 



(n) 
n-1 



. < p + e < 1 

(n) r 



or equivalently, 

<4"M ' ' ^ ' " )• (3-12) 



+ „ (n) A + P + e 



2 

Let p' be such that < p' < 1 "^ +e \ that is, 

1 + p + e 



<l-p'<l. (3.13) 
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It follows from (|XT2|) and (fXHj) that 



^ / u y- 1 r l n) (i-p') / M y- 1 

j-W 1 ~ IR /(n)dn - 74TW) 1_ 30 /(n)dn 



x£°(l-f/) 



> (P') r_1 / Wfl+ + , 

r x ii (l+P+e) 
2 



with p" = (pO^Hl - P' - 4 £ ) > and cn G (^ n) (l + p + e)/2, ^ n) (l - pi)). 



-■2 r 2 
On the other hand, we have 



1 f °° ' u A f(u)du = / °V ly-'^^du. 



It follows from Assumption (G r ) that 



limsu P m 1 , I ( " 1 I /( n ) du = °- 



("-) IV / (V) I (n) 

n^+co x y n 'f(x\ J ) Jx^> \x y n ' 

Consequently we have for large enough n, 



/ \ r-1 

1 



f{u)du < p 



so that (A < c n < xi n ' ) for large enough n and / is non-increasing in [A, +oo)) 

+00/ \ r-1 f x (n) / \ r_1 

(n) I " 1 f(u)du < pfxtofizW) < p"x^f(c n ) < y,(n) i+ ,(n) ( 1 - -^J /(u)cfo 

which is not possible since the //-stationary equation implies 

fn) / \ r — 1 / \ r— 1 



We have then shown that 



(n) 

Iim 



n— >+oo „,( n ) 



X n-1 

It follows that Ve >, 3 n e such that Vra > n £ , affl < (1 + e)x^l 1 . Thus, 

p n = < (1 + e)x^l 1 < (1 + e)£ V e G C (a n ), a G M(a n ). 
This completes the proof. □ 
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Lemma 3.2. Let (a n )n>i be a sequence of L r -optimal n-quantizers of the distribution P. Suppose 
that (H) holds. Then for large enough n, 



where 



<W = *-" +d) (^7) TT^ ^(U(B(, , £)),. (3.15, 



Proof. Step 1. Let y G Without lost of generality we temporarily set 5 n = d(y, a n ). Following 
the lines of the proof of Theorem 2 in [5] we have for every x G B(y, S n /2) and a G a n , 

\x — a\ > \y — a\ — \x — a\ > 5 n /2 

and hence 

d( x >«n) > <W 2 > \x - y\, x G B(y, 5 n /2). 
It follows, by setting [5 n = a n U {?/}, that 

d(x,p n ) = \x-y\, x£B(y,5 n /2). 

Consequently for every b G (0, 1/2), 

e^ r (X) - e r n+ljr (X) > J d(x, a n ) r dP(x) - J d(x, f3 n ) r dP{x) 



> / (d(x,a n ) r -d{x,(5 n ) r )dP{x) 

JB{y,S n b) 

(d(x, a n ) r - \x - y\ r )dP[x) 

B(y,S n b) 

> [ {(5n/2) r - (5 n b) r )dP(x) 

JB{y,S n b) 

= (2- r -b r )5* n P(B(y,5 n b)). 

Step 2. Now, coming back to the core of our proof let xq and tq be as in (H). For every y G B(xq, ^) 
we have 

e r n AX) - e r n+ iA X ) > (2~ r -b r )6 r n P(B(y,(b5 n )A^)) 

> (2- - b r ) 6 r n e ((& 5 n ) d A l {yeB{ ^)y 

One checks that 

sup d(y,a n )^0. 

Otherwise By^ G B(xo, 77 > and a subsequence (a </ ,( n ))n>i of (a n )n>i such that for every 
n>l, d(y 00 ,a tp (n)) > \- Then 



d(a v(n) ,O r P(dt) > [ d( a ^ {n) ,O r P(dO- 
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Moreover for every £ £ B(y 0O ^) 

d(a^ n ),0 > d(y oc^O)) -£%»,£) > r ~ 7 

so that 

This contradicts the fact that e Ujr (X) —> as n goes to infinity. Consequently, for large enough n, 

sup d(y, a n ) < ^ 

so that 



e ; >r (Z) - e r n+1>r (X) > (2~ r - b r ) b d d(y,a n ) d+r e ^yeB^M)}- 



It follows that 



(e r n>r (X) - e r n+1>r (X))X d (dy) > (2~ r - b r ) e b d [ d(y, a n ) d+r \ d {dy) 

> (2~ r - b r ) e b d X d (B(x , ^)) e r n + d +d (U(B(x , ^))). 

Then, 

< r (X) - < +1 , r (X) > (2^ - b r ) so b d e r + d +d (U(B(x , ^))). 
Consequently, for large enough n, 

e r n>r (X) - e r n+1>r (X) > (2~ r - b r ) b d Q d+r (U(B(x , ^))) n^. 

1 + £q I 



As a function of b, the right hand side of this last inequality reaches its maximum at b* = \ ( 



l/r 

Which completes the proof. □ 

Proof of Theorem 13.71 Let a £ M(a„) and e > 0. We have, 

Epf - X "- 1 | r < E\X - X an \^ \ r 

and 

E\X-X°»\*\* = E(|X-X«»ri {XeC7 e K)} )+E( min \X - b\n {XeCa{an)} ) 

b£a n \{a} 

< E|X-X a "r'+E( min (|X| + |6|) r l { ^ eCo(an ) } ) . 

It follows from Lemma l3Tl (q) that 3 n £ £ N such that Vn > n £ , \X\ > on {X £ C a (a n )}. 
Consequently, for all 6 £ a n \{a}, \b\ < \a\ = p n < (2 + e) \X\. 
Hence, 

E\X - X a *-i\ r -E\X - X an \ r < (3 + e) r E(|JT| r l { | X | > ^_ } ). 



Lemma |3~2"1 yields (since (n — 1) d ~ n d as n — ► +oo) 



(1 + e^CV^t/ n" 1 ^ < (3 + e) r E(|X| r l { | X |>^ } ) 
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for large enough n, so that for every e > 0, 



n 



2 + e' J ~ {3 + e) r (l + e) 



Taking the limit as e — ► gives the statement (|3.7p . Assertion (|3.8p is proved as above by using 
Lemma [37X1 {b) instead of Lemma [37X1 (a). □ 

Recall that F r (x) = E(|X| r l||j S(: | >;c }). It is clear that this function is non-increasing and goes 
to as x — > +oo (provided E|X| r < +oo). Consequently, — log F r (x) is monotone nondecreasing 
and goes to +oo as x goes to +oo. Moreover, we know that if a function / defined on (0, +oo) is 
increasing to +oo (at +oo), its generalized inverse function f*~ defined by Vx > 0, 

r(x) = in£{t > 0, f(t) > x} (3.16) 

is monotone increasing to +oo. On the other hand, the following result holds (see 0): If furthermore 
/ is regularly varying (at +oo) with index 1/5,5 > 0, then there exists a function ip, regularly varying 
with index 5 and satisfying 

tfiffa)) ~ /(V^)) ~ x as i-> +oo. (3.17) 

The function tp is an asymptotic inverse of / and it is not necessarily increasing neither continuous. 
Moreover, ip is unique up to asymptotic equivalence and f*~ is one version of ip. 

We next show that for distributions with exponential tails, specifying either the asymptotic in- 
verse 4> r (if any) of the function — log F r or finding some asymptotic upper bound ip r of 4> r (having 
some " nice" properties) leads to an upper estimate of the maximal radius sequence. This estimate is 
connected to the chosen function ip r . 

When the distribution has a polynomial tail, we will look for the asymptotic inverse function of 
— log F r (e x ) or some asymptotic upper bound tp r of it to provide an upper estimate of (log p n )n>\- 

Proposition 3.3. Assume that the distribution P of X has an unbounded support and satisfies (H). 
Let (a n ) n >i be an L r (P)-optimal sequence of n-quantizers. 

(a) If ip r is a measurable nondecreasing function, regularly varying with index 5 and 

tpr(— log F r (x)) > x + o(x) as x — > +oo, (3.18) 

then 

lim sup Pn < 2 fl + . (3. 19) 



Vv(log(n)) d) 
Ifd = l and r > 1 and if(G T ) holds then, one has 



lim sup Pn < (r + if. (3.20) 

i/» r (log(n)) 



n— >+oo 



In particular if — log F r has regular variation with index 1/5 then l\3.I9\i holds with ip r = (— log F r )*~. 
(b) If ip r is a measurable nondecreasing function, regularly varying with index 5 and 

ip r (—\ogF r (e x ))>x + o(x) as x — ► +oo, (3.21) 

then 

lim sup -^L- <fl + ;V. (3.22) 



ip r (log(n)) V dJ 
If — log F r {e') has regular variation of index 1/5 then (jj. 22|) holds with ip r = (— log F r (e')) 
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Prior to the proof, let us make some comment on the proposition. First note that the measurability 
of ip r is necessary to define the regular varying property. On the other hand we have for every r > 
and for every x > 0, 

F T (x) > x r F(x). 

Then 

— logF r (x) < — logF(x) — rlog(x). 
According to the nondecreasing hypothesis on ip we have for every x > 1, 

Vv(-logF r (x)) < Vr(-log-F(x) -rlog(x)) < tp r (- logF(x)) (3.23) 

since log{x) > 0. Hence if Assumption (|3- 1 8 j) holds then 

1pr(— logF(x)) > X + o(x). 

We will see further on that for distributions with exponential tails, the function tp r in the statement (a) 
of the proposition does not depend on r. However in the situation of the item (6) of the proposition, 
Assumption (|3.21 j) implies that 

t/v(- log F(e x )) > (r + l)x + o(x). 

Consequently, taking F instead of F r in Assumption (|3.2ip will induce a loss of precision in the upper 
estimate of log p n . 

Also remark that if — logF r (resp. — logF r (e") ) is measurable, locally bounded and regu- 
larly varying with index 1/5,5 > then its generalized inverse function <fi r (resp. <I> r ) is mea- 
surable increasing to +oo, regularly varying with index 5 and, 4> r {— \ogF T {x)) = x + o(x) (resp. 
$ r (- \ogF r {e x )) = x + o(x)). Consequently, both inequalities ()3. 19|) and ()3.20j) (resp. claim ()3.22j) ) 
hold with 4> r (resp. 3> r ) in place of ip r . However, 4> r (resp. <£ r ) is in general not easy to compute 
and the examples below show that it is often easier to exhibit directly a function tp r satisfying the 
announced hypotheses without inducing any asymptotic loss of accuracy. 

We prove now the proposition. 

Proof, (a) It follows from (|3-7|) and (|3.8j) that for every e > 0, there is a positive real constant C r ^u,£ 
depending on the indexing parameters such that 

n d C r d,U,e < Fr{ 1 ) 

where (from now on) c rj i = 1 if r > 1; c r ^ = 2 otherwise. Therefore, one has 

^log(n)-log(a,^, £ ) > -logF r (-^-). 
Combining the fact that ip r is nondecreasing and Assumption ()3. 1 8[) yield 

^ r ( r -±^ \og(n) -log(C rAU)£ )) > ^ r (-\ogF r (-^)) 

> Pn , c +0(p n )- 

C r ,d + £ 
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Moreover, dividing by ^v(log(n)) (which is positive for large enough n) yields 

Pn . . 0(p n ),-lVr( r ^l0g(n) -log(C ridiC / i£ )) 

— t: — r^r < (CVd + e)(l H J — 7, — -r~\\ ■ 

It follows from the regular varying hypothesis on ip r and lim p n = +00 that 

n 

Pn ( f ^ \ ^ 

limsup " < (crd + e) — — , Ve > 0. 
n— +00 ^r(log(n)J \ d J 

The result follows by letting e — > 0. 

(b) As previously, one derives from (|3.7p and from Assumption (|3.21 j) and the nondecreasing hy- 
pothesis on ip r that 

^L±^log(n)-log(C rAU>£ )) > ^ r {-\ogF r {-^—)) 

Q Cr,c£ + £ 

> log p„ - log(c M + e) + o(log p n ). 

It follows that 

logPn w„ . , ^ „n„„„ nn-i.,. rr±d 



^ r (log(n)) 



< 



l log(c M + g) | o(logp„) \ MrjT lQ g( n ) - 1 Og(Cr,d,t/, E )) 
log/?„ logPn / Vr(log(n)) 



Owing to the regular varying hypothesis on ip r and the fact that lim p n = +00, we have 



limsup— — — < — — 

n-,+00 ipr{LOg{n)) \ d 



□ 



We next give an explicit asymptotic upper bound for the convergence rate of the maximal radius 
sequence in the sense that the function ip r is made explicit. These bounds are derived on the rate of 
decay of the generalized survival function F r . 

Criterion 3.1. (a) Let X be a random variable with unbounded support. Let r > and let (a n ) n >i 
be an L r -optimal sequence of n- quantizers for X. Let k > such that e^ x ^ 6 L 0+ (P). Set 

9* = sup {9 > 0, limsup e 6xK F r (x) < +00} = sup {9 > 0, E e 9|x|K < +00}. (3.24) 

x— >+oo 

Then 9* £ (0, +00] and 

limsup —f=-^< 2 ( r 4^) llK - (3-25) 
When d = 1 ant/ r > 1, if(G r ) holds then 



lim sup 7T- < 

'log(n)) /K 



Pn , ( r + 1 N 



n— >+oo 



(A) Le? X £ L r+ (P) be a random variable with unbounded support. Set 

(* = sup{( > 0, limsup x c_r F r (x) < +00} = sup{( > r, E|X| C < +00}. (3.26) 
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Then (* E (r, +00] and 

log p n 1 r + d 
hmsup - — ^- < — — . (3.27) 

Prior to the proof we can make the following remark. 
Remark 3.1. If X € f| L r (P) then C* = +00 and consequently lim = 0. 

Proof . (a) The equalities in (|3.24|) and (|3.26|) are obvious. 
Let 9 G (0,6>*). We have 

E(|xri {pC | >a!} ) =E{\xn {eemK>e9xK} ) < e -^E(|xr e w). 

Now, the right hand side of this last inequality is finite because if 9' £ (9, 9*), 

\~.\r„6\x\ K ^- -t 1 r< s,9'\ x \ K 

\x\ e 1 s 1 + LsQg/e 1 1 . 

As a consequence, 



logF r (x) > 9x K + C x , C X £ 

nction of y, i 
regularly varying with index 5 = \ and we have 



Let tpg(y) = (f) • As a function of y, i\)q is continuous (then measurable) increasing to +00, 



Cx \ ^ K 

-06»(- logF r (2;)) > ( x K + — = x + o(x), asx^+oo. 



It follows from Proposition 13.31 (a) that 

lim sup /K < c M (^) 1A V* G (0, **)• 

( log(n)) 7 \ ao / 

Letting 9 —> 9* completes the proof, 
(b) Let ( G (r,C*). We have 

E (l^r i {|x|> i r}) = E (l^ri{i< a; -c+'-|x|C-'-}) 
< x^ +r E\X\(. 

Then 

- log F r (x) > (C - r ) log(x) + C 
so that by setting ^v(ai) = it follows from Proposition [33] (6) that 

log/3 n 1 r + d 

hmsup - — -- < — . 

n-H-oo log(n) Q-r d 

Letting ( go to yields the assertion (|3.27p . □ 

Remark 3.2. Note that the choice of the function ip r in the statement (a) of Proposition !3.3l does not 
depend on r as approved in the proof of the item (a) of the previous criterion. But for distributions 
with polynomial tails the choice of Vv clearly depends on r. 
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We now give more explicit results for specified density functions. 
Corollary 3.1. (a) Suppose that the density f of X satisfies 

f(x) oc \x\ c e'^ x G M. d ; ■&, k > 0; c> -d. (3.28) 

Then 9* = •& and 

limsup r-r- < —rr 1 + - . (3.29) 

;iog(n)) 1/K " dJ 



n—f+oo 



(b) If the density of X reads 



f(x) cc fl°gM) ln_| >u iGK d ,/56l,c>r + d (3.30) 
then C,* = c — d and 

limsup - — < — . (3.31) 

n ->+oo log(n) c - d-r d 

(c) The statement l\3.29\\ (resp §3.3 1$ ) holds if the density of X is simply equivalent to the specified 
density in <\3. 28\\ ( resp in <\3. 30\i j. 

Notice that the restriction c > r + d in (|3.30p ensures that E|X| r < +oo. 
Proof, (a) We have 

F r (x)=E(\X\ r l m>x} ) = K [ \u\ r+c e^\<d\ d {u) 

J{\u\>x} 
r+oo 

= K Vd j p^+d- 1 e-^dp 



where K = J \x\ c e~^ x ^ d\d(x) is the normalizing positive real constant in ()3.28p and Vd denotes 
the volume of the Euclidean unit ball. Integrating by parts and using usual integral comparison rules 
yields 

F r (x) = C dAK x ^+ d - e"^ K (l + (1)) 
for a positive real constant Cd $ K - It follows that if 6 < $ then 



c 9x K 

x— >+oo 



limsup e F r (x) < +oo 

and if 8 > $ then 



limsup e F r (x) = +oo. 

x— >+oo 

Which means that 9* = and the statement (|3.29p follows from Criterion 13. II (a). 
(b) We have for every r > 0, for every x > 1, 

- f QogHf f +oc (logp)^ 

J{\u\>x} M Jx p 
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with d = c — r — d + 1 (d > 1) and Vd denned as previously. Integrating by parts and multiplying 
by x^~ r yields 

x<- r F r (x) = KV *. log(x)^- c+d (l + o(l)). 
c — (r + a) 

Consequently, £* = c — d. The statement f)3.3 1 1) follows from Criterion [XT] (6). 

(c) Obvious from the forgoing. □ 

We now give some examples for usual distributions. 

Example 3.1. (a) • If X ~ AA(0; I d ), we have 

f(x) = {27T)- d / 2 e-^\ 

It follows from the item (a) of the previous corollary (with i9 = 1/2, k = 2, c = 0) that for every 
r > 0, for every d > 1, 

limsup Pn < 2W2 (l + ~ 

n.-»+oo y'log(n) V ^ d 



In particular, when r > l,d= 1 we have lim sup , p " < \j2(r + 1). 
• For a double Gamma distribution in the real line where 

f(x) = -^-\x\ a - 1 e- x W, i S K; A, a > 
21 (aj 

or a Gamma distribution for which 

f(x) = £- x °-i e -**l {x>0} , X,a>0 

r(a) 

we have (from Corollary 13. II (a) with c = a — l,i? = A, k = 1) for every r £ (0, 1), 

p n . 2(r + 1) 
hmsup- — -- < 

n~>+oo log(n) A 



and in case r > 1 we have 



,■ p n r+l 

limsup - — -— < 



n^+oo log(n) A 
which coincides with the sharp rate given in (|3.3p • 

• When X has a logistic distribution with density f(x) = ^ e - x ^ we have 

f(x) ~ e~ x as x — > +oo. 

Then, it follows from Corollary 13. 11 (c) that (p n ) n >i has the same upper asymptotic as the exponential 
distribution with parameter A = 1. 

• As concern the Weibull distribution with shape parameter k, > with density function 

f(x) = KX K - l e- xK l {x > 0} 
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it follows from Corollary EH (a) (with i? = 1) that for r £ (0, 1), 

limsup < 2(r + l) 1 /* 

n-»+oo (log(n)) /K 

and if r > 1 

limsup P " < (r + I) 1 /*. 

n^+oo (log(n)) 

(b) Suppose X is a random variable having a Pareto distribution with index 7 > r. The density 
function reads 

f(x) = 7X -^ +1 h {x>1} . 

Then we deduce from Corollary 13. II (b) (with d=l, c = 7 + l,/3 = 0) that 

logp„ r + 1 

limsup - — ^— < . 

log(n) 7-r 



3.2 Lower estimate 

In this section we investigate the asymptotic lower estimate of the maximal radius sequence (p n )n>i 
induced by an //-optimal sequence of n-quantizers. First we introduce the family of the (r, s)- 
distributions which will play a crucial role to obtain the optimal lower estimate for the rate of the 
maximal radius sequence. 

Let r > 0, s > r. Since the //-norm is increasing, it is clear that for every s < r any //-optimal 
sequence of quantizers (a n ) n >i is //-rate optimal i.e. 

limsup n 1/d \\X - X an \\ s < +00. (3.32) 

n— >+oo 

But if s > r (and X £ L S (P)) this asymptotic rate optimality usually fails . So is always the case 
when s > r + d and X has a probability distribution / satisfying \d(f > 0) = +00, see |5]. But it is 
established in [TTj that some linear transformation of the //-optimal quantizers (a n ) makes possible 
to overcome the critical exponent r + d, that is, one can always construct an L s -rate-optimal sequence 
of quantizers by a linear transformation of the //-optimal sequence of quantizers (a n ). However 
there is some distributions for which (|3.32p holds for every s £ [r,r + d). This leads to the following 
definition: 

Definition 3.1. Let s,r > 0, s > r. A random vector X £ L S (P) has an (r, s)-distribution if any 
U -optimal sequence {a n ) n >i is L s -rate optimal. 

For v £ (0, d), sufficient conditions such that X has an (r, r + ^-distribution are provided in [5]. 
Let us mention two criterions ensuring that a random vector X has an (r, r + ^-distribution. The first 
one deals with distributions with radial tails. 

Criterion 3.2. (a) Let d > 1. Iff = h(\ ■ \) on B\.\(Q,N) C with h : (R,+oo) -» R+, R £ R+, a 
decreasing function and \ ■ \ any norm on M. d . If 

J f(cx)-^dP(x) < +00 (3.33) 

for some c > 1. Then X has an (r, r + ^-distribution. 

(b)Ifd = 1 and ifsupp(P) C [Rq, +00) for some Rq £ Wand f,, R i +oo ^ is decreasing for R Q > Ro. 
If further Assumption 33\i holds for some c > 1. Then X has an (r, r + ^-distribution. 
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The following criterion works for distributions with non radial tails. 



Criterion 3.3. Let r > 0, v G (0, d), P = f ■ \d and J \x\ r+v P(dx) < +oo for some r/ > 0. 
Assume that supp(P) is convex and that f satisfies the local growth control assumption : there exists 
real numbers e > 0, rj G (0, 1/2), M, C > such that 

Vx, y G supp(P), \x\ > M, \y - x\ < 2r]\x\ => /(y) > C/(x) 1+e . 

// 

(r+^)(l+e) 

/(x) dP(ai) < +oo, (3.34) 



then X has an (r, r + u)-distribution. 

Furthermore a necessary condition for X (with density /) to have an (r, r + 1/) -distribution is (see 
0): 

/" (r+i/) 

X has an (r, r + v) -distribution =^ / f{x) d + r dP(x) < +oo. (3.35) 



It follows from (j3.33 [) and (|3.35j) that the Gaussian distribution, the Weibull and the Gamma 
distributions have an (r, r + ^-distribution if and only if v G (0, d). The Pareto distribution with 
index 7 > r has an (r, r + ^-distribution if and only if v G (0, ^y)- 

Now, suppose that X has an (r, r + u) -distribution for some u G (0, d) and set 

v\ := sup{u > s.t. X has an (r, r + z^) -distribution}. 

Note that 

{v > s.t. X has an (r, r + v) -distribution} = (0, u* x ) or (0, v* x ] 
and that X G L r+iy (P), Vi/ G (0, u x ). When 

> s.t. X has an (r, r + z^) -distribution} = 

we set u x = 0. 

This index v x will play a crucial role to determine the lower bound of the maximal radius se- 
quence. Recall that if X has a density / satisfying A<f (/ > 0) = +00 then a necessary condition for 
X to have (r, r + z/)-distribution is that v < d. Which means that v x < d. However, this inequality 
may stand strictly as approved by the Pareto distribution with index 7 for which v\ = < 1 . 

We present below two different approaches to get the lower bound for the maximal radius se- 
quence. The first one involves the generalized survival functions F r like for upper bounds and is 
based on tail estimates. The second one is probably more original. It is based on random quantiza- 
tion: its specificity is to provide a close connection between the sequence (p n ) n >i and the maximum 
of the norm of an i.i.d sequence of random variables with distributions P. 



3.2.1 Distribution tail approach 

The main result of this section is the following theorem. 

Theorem 3.2. Let r > and let X be a M. d -valued random variable with probability distribution P. 
Let (a n )n>i be an L r (P) -optimal sequence of n-quantizers. For every v G (0, u x ), the following 
statements hold: 
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(a) 

limsup sup ( c r+v n~s~ F(p n + c) J < +00. (3.36) 

n— >+oo c>0 * ' 

(b) 

limsup sup ( (1 — 1/u) n d F r (up n )j < +00. (3.37) 

n— >+oo u>l ^ ' 

Proo/. (a) Let c> and let v G (0, Then 

E|X - X a "| r+ ^ > E ( min \X - a\ r+u l. 
On the events {\X\ > p n + c}, we have: \X\ > p n + c> p n > \a\, Va G a n . Then 
E | X _X^r^ > E(miu|X-ar +1 'l { | X | >Pri+c} ) 

- E (x " H) r+iyi {w>P"+c} 

> E((|X|- /0n ) r+ %| X | >Pn+c} ) (3.38) 

> c r+ »F({\X\ > Pn + c\). 

It follows that 

E\X-X a ™\ r+u > sup(c r+l/ P({|X| > Pn + c})) . 
00 

Since X has an (r, r + u) -distribution we have 

limsup n^r\\X - X an \\ r r Xl < +00. 

n 

Which completes the proof. 

(b) is proved like (a). Inequality (|3.38p becomes: for every u > 1, 



E\X- X^ > E [{\X\ - Pn ) r+V l { \x\>u Pn} ) > E (\Xr» (1 - l/u) r+ "l m>upn} ) . 
Then, 

E|X - X a *r v > sup [(1 - l/u) r+ »E(\Xrn m>upn} )] . 

U>1 

Inequality (|3.37|) follows by noticing that limsup n~d~ \\X — X an \\r+u < +00. □ 

Like for the upper estimate, given the asymptotic inverse function of — log F or given an asymp- 
totic lower bound ip of <fi satisfying some standard hypotheses specified below, we provide the asymp- 
totic lower estimate for the maximal radius sequence for distributions with exponential tails. 

For distributions with polynomial tails, we will rather look for the asymptotic inverse function 
^r,v, v G (0, v*x) an y) °f ~~ 1°§ Fr+v(e x ) or some asymptotic lower bound of it to provide a lower 
estimate of log p n . 
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Proposition 3.4. Let r > and let X be an M. d -valued random variable with distribution P. Suppose 
that X has an unbounded support. Let (a n ) n >i be an L r (P)-optimal sequence of n-quantizers. 

(a) If ip is a measurable nondecreasing function going to +00 as x — > +00, regularly varying with 
index 5 and satisfying 

ip(-logF(x)) < x + o(x), (3.39) 



then 



liminf — £H— > ( r -±^) . (3.40) 

^(log(n)) V d J 



n— »+oo 



If — log F is regularly varying of index 1/5 then (\3.40\i holds with ip = (— log F)^. 

(b) Let v G (0, u* x ). If there is a measurable nondecreasing function ip r)U (x) going to +00 as x ^ 
+00, regularly varying with index 5 and satisfying 

il>rA~ lo § Fr+u{e x )) <x + o(x), (3.41) 

then 

liminf . l ° ( f Pn ( „ >( r -^-) 5 - (3-42) 

In particular if — log F r+U (e x ) has regular variation with index 1/5 then <\3.42\i holds with ip r ,u(x) = 
(-logF r+i ,(e*)r. 

Let us provide a few comments on this proposition. We have for every r > and for every 
x > 0, F r (x) > x r F(x). Then 

— logF r (x) < — \ogF(x) — rlog(x). 

According to the nondecreasing hypothesis on ip we have for every x > 1 

ip(— logF r (x)) < ip(— logF(x) — r log(x)) < tp(— logF(x)) 

so that if p.39p holds then for every r > 0, for every v G (0, v\\ 

ip(- log F r+U (x)) < x + o(x). 

Reproducing the proof (given below) of Proposition 13.41 (a) by using ()3.37p instead of (|3.36p shows 
that (|3.40|) still holds true. This means that for distribution with exponential tails, the function ip do 
not depend on r and v even if in Assumption (|3-39|) we take the generalized survival function F r+V in 
place of the regular survival function F. However for distributions with polynomial tails like Pareto 
distribution the function ip rv in (|3.4ip may depend on r and taking the regular survival function 
F in place of the generalized survival function F r+V would make lose the dependance upon r and 
consequently lead to a less accurate result. 
We next prove the proposition. 

Proof, (a) Assume v* x > and let v G (0, u x ). It follows from (|3.37p that for large enough n, 

- log F(p n + c) > - log(a, c ) + T -^- log(n) 
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where C UtC is a positive real constant depending on the indexing parameters. It follows from the fact 
that ip is nondecreasing and goes to +00 and from Assumption ()3.39p that 

c o( Pn )Y l ^(^log(n)-log(a, c )) 



> 1 + — + 



V>(log(n)) V Pn Pn J V(log(n)) 
Since tp is regularly varying with index 5 we have 

liminf Pn > , Vi/€ (0,i£). 

n-++oo -0(log(n)) \ a J 

Letting v — > v* x give the announced result. If v% = 0, one follows the same proof with v = 0. 

(b) This is proved like the statement (b) in Proposition 13.31 by considering F r+V instead of F r , for 

The next criterion is the lower limit counterpart of Criterion 13. II 

Criterion 3.4. (a) Let X be an M. d -valued random variable with unbounded support and suppose 
that 

6* = inf {6 > 0, liminf e 0x >(|X| > x) > 0} G (0, +00]. (3.43) 



Then 

lim inf ^— > ( r -±pt) ^ . (3.44) 

n ^+°° (log(n)) 1/K V d0* J 

(b) Let X be a random variable with unbounded support such that v x > 0. Set 

C* = inf {C > 0, Vi/ G (0, vt), liminf x c ' r ~ u F r+ Jx) > 0) G [r + ut, +00]. (3.45) 

Then 

\ogp n 1 r + v* x 
liminf - — — > — . (3.46) 

n~*+oo log(n) Q± — r — v x a 

Proof, (a) Let 6 G (0*, +00). Then 

F(x) > C e- dxK 
for large enough x and for a positive real constant C. Therefore 

-logF(x) < 6x K (l - 

x R 

so that by setting tp$(x) = {x/6) l / K we have 

log F(x)) < x + o(x). 

It follows from Proposition 13.41 (a) that 

liminf — —r- > , , ( 



Oog(n)) 1/K ® 1,K V d 
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We let 9 go to 9* to get the announced result. 

(b) Let C G (0, C*). We have, for every v G (0, v* x ), 

Fr+ V (x) > C u x^ +r+u 

for large enough x and for a positive real constant C v . Then, by setting tp r ^(x) = ( ^_ X T _ V we get 
ip r ,v{— log F r+U (x)) < log(x) + o(\og(x)). It follows from Proposition 13.41 (b) that 

. logp„ 1 r + z; 
hmmf - — — > — . 

n— >+oo log(n) ^ — r — ^ a 

The right hand side of this last inequality is increasing on (0, v x ) (as a function of v) and on (0, C*) 
(as a function of so that by letting v and £ go respectively to v x and we get 

liminf^> 1 r + ^ 



n-»+oo log(n) — r — v x d 



□ 



Corollary 3.2. (a) If the density function of X reads 

fix) oc |x| c e^ 1 " x e M d ; tf, k > 0; c> -d (3.47) 
f/ierc v x = d and 9* = 9* = In this case we have for every r > 0, for every d > 1, 

1 ( i + L) 1/K < ii m i n f ^1L^ < i imsup Pj^^ < _jL / r\ V* 



#i/k\ d J (l g(n)) 1/K «-+oo (log(n)) 1/K V 

When d = 1 and r > 1 we /lave 

Pn I r + 1 x 



lim = — — . (3.49) 

"^ +0 ° (log(n)) 1/K V # / 

(b) IfX has a density f satisfying 

fix) oc SggML i.. . , ieK d j6l,c>r + d (3.50) 

then v x = d (l — *-~) £ (0, d) and = (,* = c — d. Furthermore we have for every r > and 
every d > 1, 

lu ? i — rr = 3 j • ( 3 - 51 ) 

n-»+oo log(n) c — r — d d 

(c) The claim $3.48\i ( resp ()J.5ip J holds if the density ofX is simply equivalent to the specified density 
in fi3A7\ (resp in (TOO)) ). 
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Proof . (a) It is obvious from Criterion |X2] and (|3.35p that v* x = d. Let K be the normalizing positive 
real constant in (|3.47p . We have for every x > 0, 



P(|X| >x) = K [ \u\ c e-^ K d\ d (u) 

J{\u\>x} 
C+oo 

= K d p c+d ~ l e-^dp 

J X 



C+d—K 

= K d—g- e-^(l + o(l)) 

where we used an integration by parts and usual integral comparison criterions. Consequently, if 
> i? then 

liminf e 8xK F{\X\ > x) = +00 

X— > + oo 

and if 6 < 1? then 

liminf e ex >(|X| > x) = 0. 

a;— »+oo 

Which means that 9* = 1? and the statement (|3.48p follows from Criterion 13.11 (a) and Criterion 
[371(g). 

(b) Let c > 1. We have 



'{/>o} -mN>i} l u l Jl P 

with d = c(l — £rg). -Kd,g is some positive real constant and /?' G K. We deduce that if 



d > d <j=> v < d ( 1 - r + ) then / /(ex) ^ f(x)d\ d (x) < +00 
V c / i{/>0| 



and if 



/ r + d\ / 
c < d <S=^ 1/ > d 1 then / /(x) r + d f(x)d\d{x) = +00 

V c / J{f>o} 

so that (from Criterion [321 and Statement ([335]) ) i/£ = d (l - ^) . 

Let us show that = c — d. For every r > 0, for every 1/ G (0, for every x > 1, integrating 
by parts and using integral comparison criterions yield 



F r+u (x)=E(\Xm m>x} ) = [ ^Mf- d \ d {u) 

J{\u\>x} \ u \ 

(logpf 



V d / ^r^dp 
Jx P c 

V d ^-\og{xf{l + o{l)) 
c — 1 



with d := c — r — v — d + 1 > 1. It follows that 



^- r ~ v F r+u {x) = -p-log{xfxt- c+d {l + o(l)) 
d — 1 
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so that for every r > and for every v G (0, z/y), if £ > c — d then 

liminf x ( '~ r ~ u F r+U {x) = +00 

x— >+oo 

and if £ < c — d, 



Uminf x c_r_, 'F r+l/ (x) = 0. 

x— »+oo 



Hence = c— d. It follows from Criterion 13. 11 (b) and Criterion 13.41 (b) that 

1 r + z/£ . log|O n log/? n 1 r + d 

< hmml - — — — - < limsup - — p— r < 



C* — ' — d n-»+oo log(n) n-»+oo log(n) c — r — d d 

Now (recall that C* = c - d and v\ = d (l - ^f) ) , 

1 r + i/y 1 r + d 

— r — v\ d c — r — d d 

so that 

lim ^g/ 9 " _ 1 r + d 
n— >+oo log(ra) c — r — d d 

(c) Obvious from what forgoes. □ 

We deal now with examples. 
Example 3.2. (1) It follows from Corollary (a) that 
• When X ~ A/"(0, Id), for every r > 0, for every d > 1, 



< liminf -£= < limsup ^= < 2,3^. 
d y^log(n) y/\og(n) V d 

In case d = 1 and r > 1 we have 

lim = v^+l). 

n ^+°° ^/log(n) 

• If X ~ T(a, A), a > 0, A > or if X has a double gamma distribution we have for every r > 1, 

n->+oolog(n) A 



(which coincides to the exact rate given in (|3.3p for the exponential distribution) and for every r € 
(0,1), 

r + l p n p n 2(r + 1) 
— - — < lim mi - — — < limsup - — — < . 

A rw+oo log(n) „^+oo log(n) A 

• As concern the logistic distribution, the maximal radius sequence has the same asymptotic as the 
exponential distribution with parameter A = 1 following Corollary 13.21 (c). 

• For a Weibull distribution with shape parameter k > we have for every r > 1, 

n ^ + °°(log(n)) 1/fc 
27 



For r G (0, 1), one has 

( r + i)V" < Hminf pn < Jim sup Pn -, < 2(r + 

n ^+°° (log(n)) 1/K ™-+°° (log(n)) 1/K 

(2) Suppose X is a random variable having a Pareto distribution with index 7 > r where the density 
reads f(x) = 7:r _ ( 7+1 )l{ a , >1 }. It follows from Corollary 13.21 (b) (with c = 7 + 1, d = 1) 

logp n r+1 
lim - — ^ = . (3.52) 

n->+oo log(n) 7 — r 



We retrieve of course the sharp rate given in (|3.4|h 

3.2.2 An alternative approach by random quantization approach 

Let X ~ P. Random quantization is another tool to compute the lower estimate of the maximal 
radius sequence. It makes a connection between p n and the maximum of an i.i.d sequence of random 
variables with distributions P. 

Theorem 3.3. Let r > and let X be a random variable taking values in M. d with probability 
distribution P with P a 7^ 0. Assume (a n ) n >i is an L r (P)-optimal sequence of n-quantizers. Let 
{Xk)k>i be an i.i.d sequence of M^-valued random variables with probability distribution P. For 
every v € (0, v* x ), 

liminf (p n — E( max l^fcl)) > ~~ C u (3.53) 

n->+oo v v k<[n( r +' J )/ d ] 

where C v is a positive real constant. 

Proof. Let v G (0, v* x ) and let X% n = E aean al{x k& c a (a n )}- We have, 
p n > max I X™ n I 

k<m 
m 

- Dmax|Xf"|l { | Xfc | >max . #fc | X .| } 

k=l ~ m 

m 

- ^2\ X k"\ l {\X k \>m^ k \Xi\} 
k=l 

m 

^ ^2 (l Xfc l ~ \ Xk - Xk'DMlXk^m^k^} 
k=l 

m 

> Emax|X fc |-^E(|X fc -X^|l { | Xfc | >maXi ^| Xi | } 

~ m k=l 

Furthermore, 

Vk > 1, \X k - X^ n \l{\ Xk \ >msx .^ k \ X .\} = \Xi -X" n |l { | Xl | >maXM1 | Xi |}. 

Hence, 

Pn > Emax|X fe | - mE (\X X - Xf n \l { \ Xx>m ^ 1 \ Xl \}) 

/ \ 1-1/ (r+u) 

> Emax|X fc | - m \\Xt - X? n \\ r+U P(|Xi| > max|X;|) 

k<m V i^l J 
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Since the events 



{\X k \ > max|Xj|}, k= l,--- , m 



are pairwise disjoint with the same probability we have 

P(|Xi| > msx\Xi\) < — . 

Finally, 

p n > E max |Xj.| — ||Xi — 

k<m 

It follows, by setting m = [n^ r+u ^ d ], that 

liminf (p n — E( max |^fc|)) > — limsupn^ — Xf™|| r+l ,. 

n-H-oo fc<[n( r + y )/ d ] n->+oo 

However, since X has an (r, r + i/) -distribution, the upper limit on the right hand side of the equation 
is finite. □ 

Example 3.3. (Exponential distribution) Let r > and let X be an exponentially distributed random 
variable with parameter A > 0. If (a n ) n >i is an L r -optimal sequence of n-quantizers for X then 
Theorem 13.31 implies 

liminf ^1L_ > Z+l. (3.54) 

n-»+oo log(n) A 



which is the sharp rate given by (|3.3|) . 

Indeed, let v G (0, v* x ) and let pQ){ i=1 .... [ R r+»n, be an i.i.d exponentially distributed sequence 
of random variables with parameter A. We have for every u > 0, 

P( max Xi>u) = l- F(X X < u)^' + ^ = 1 - F(u)^ r+ "\ 
i<[n r +"] 

where F is the distribution function of X (we will denote by / its density function). Then 

E( max Xi) = / P( max X { > u)du = / (1 - (1 - e~ Xu ) [nr+v] )du 
i<[n r + v ] J i<[n r + v ] Jo 

+00 K+1-i 

= / ( Huy)e- Xu du 



1/1 1 
-(! + - + ••• + 



A v 2 [n^j ' 

> il g(l + [„»■+"]) > ^log(n). 

Consequently, it follows from the super-additivity of the liminf that for every v £ (0, 1), 

,. . . pn ^ .. , ,Pn. -E(max i<[n r+,]) _ E(max !<K+ ,]) 
lim mi - — — — - > lim mi — — — h lim mi — — — 

n-H-oo log(n) ra— ►+00 log(n) n-»+oo log(n) 

r + v 
~ A ' 

The result follows by letting v go to ^ = 1. 
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Example 3.4. (Pareto distribution) Let X be a random variable having a Pareto distribution with index 
7 > 0. If (a n ) n >i is an asymptotically L r -optimal sequence of n-quantizers for X, r is such that 
7 > r, then Theorem [33] yields 

log/On r + 1 
iimml - — j—r > . 

n-»+oo log(n) 7 + 1 

Which is not the sharp rate given by (|3.4p . 



Notice that if 7 > r then X £ L r+V (¥) for i) £ (0,7 — r). Now, to prove this result, let u € 
(0, i/y) and let (Xj)r i=1 ... for+wn be an i.i.d sequence of random variables with Pareto distribution 
with index 7. We have 

Vm>l,Vu>l, F(maxXi < u) = (1 - u _7 ) m . 

Then, the density function of maxi<j< m X{ is m7u~( 7+1 )(l — n~ 7 ) m_1 . 
Hence 

r+00 

E( max X 4 ) = ?n 7 / x" 7 (l - aT 7 ) m_1 dx 

l<i<m 

= m C u^{l-u) m - l du (u = x" 7 ) 



1 

^0 

= m £>(1 , m) 

7 

r(i-i)r(m + i) 

T(m + l-i) 

1 1 

~ r(l ) m~i as m — ► +00 

7 

where we used Stirling's formula for the last statement. We finally set m = [n r+u ] to get 

E( max Xi) ~ IYI - -) n^. 

V l<i<K+"] 7 7 

It follows from 0331 that for every e £ (0, 1), 

p n -{l- e)T{l - -) ri^T > -e - C u . 
7 

r-\-v 

Dividing both side of the inequality by n t and taking the logarithm yields 

log p n - log(n) > log f (1 - e)r(l --)-{£ + C u )n~T 

7 V 7 



Consequently 



. t logp n r + v 
hmmt - — ^— > 



n— >+oo log(n) 7 

for every v G (0, i/y). The announced result follows by letting z/ go to z/^ = 
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Comment. Let <fi be the inverse (if any) function of — log F. It can be noticed that in both previous 
examples we have 

E( max \X k \) ~ 0((r + ^)log(n)) as n -> +00 (3.55) 

k<[n r+v *x] 

which, for distributions with exponential tail leads to the same asymptotic lower bound for the se- 
quence (p n )n>i as m (|3.40[) . For Pareto distribution, using the approximation ()3.55[) to compute 
the asymptotic lower estimate of the maximal radius sequence make us loose the " — r" term in the 
exact asymptotic. To recover this reminding term we have simply to consider the inverse function 
of — log F r+V * x (as done in the previous section) instead of — log F, and, the random quantization 
approach clearly does not allow us to do so. 

3.2.3 A conjecture about the sharp rate 

The previous results related to distributions with exponential tails strongly suggest the following con- 
jecture: suppose X is a distribution with exponential tail in the sense of claim (|3.43p . Then for every 
r > 0, for every d > 1, 

,. Pn fr + d 

lim —r- = — — — 

(l g(n)) 1/K V d6* 

This conjecture is proved for d = 1 and r > 1. To be satisfied for hight dimension we need to 
proof that the geometric statement ()3.9[) of Lemma l3~T1 (a) holds true with " 1 + e" instead of " 2 + e" 
like in 1-dimension. Although this inequality looks quite intuitive in any dimension its proof seems 
out of reach when d > 2. 

3.2.4 Numerical experiments 

We now attempt to focus on numerical experiment of the maximal radius sequence (p n ) n >i for the 
quadratic optimal quantizers of the Gaussian, the Weibull and the exponential distributions. A whole 
package of quadratic optimal n-quantizers of the J\f(0, Id) distributions are available in the website 

www . quantize . maths-f i . com 

for d £ {1, • • • , 10} and n € {1, • • • , 5000}. When d = 1, these L 2 -optimal grids are obtained by 
the Newton method, see e.g. [10] for details. For the exponential distribution the quadratic optimal 
quantizers are computed by using the semi-closed formulae given in Proposition 13.11 

As concern the Weibull distribution with shape parameter k = 2, we compute the quadratic 
optimal quantizers up to 3000 using the Lloyd's I algorithm described in [10] (see [8] for a more 
itemized description of the algorithm). 

In these three cases we depicted the ratio between p n and the expected asymptotic optimal rate. 
For the exponential distribution we represent the graph of 3 ^{n) as a mnct i° n °f the grid sizes 
(see Figure d). One remarks that the convergence of 3 to 1 as n goes to infinity is almost 
instantaneous. 

However, the cases of the Gaussian and the Weibull distributions are more delicate. Indeed, for 
the Gaussian distribution the ratio , pn seems increasing but has not reached yet the value 9 even 

V 6 log(n) 

for a grid size equals 100000, as emphasized by Figure Q] (right hand side graph). For the Weibull 

distribution, , pn also seems increasing but takes values around 0.927 for a grid size equal to 

V 31 °s( n ) 

3000 (see Figure 2). Then for both cases, the convergence to 1 of the ratio between the maximal 
radius and the expected asymptotic optimal rate seems increasing but very low. 
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Figure 1: Left: 



31og(n; 



as a function of the grid size n for the exponential distribution. Right: 



function of the grid size n for the normal distribution. 



V 61 °g( n ) 



Weibull disffibution 



1500 
grid size 



Figure 2: , p " as a function of the grid size for the Weibull distribution with shape parameter k = 

A/31og(n) 
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